3574 results found.
Multimodal/Multimedia
Dataset of names for objects in images,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC-BY-4.0
Size:
25000 entries Production Status:
Newly created-finished
Use:
Linguistic and Model Analysis of Object Naming
-
Paper title:Humans Meet Models on Object Naming: A New Dataset and Analysis
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Carina Silberer | ManyNames v2 | /N |
Documentation:
None
Transcribed speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Gnu
Size:
3135 Argument Discourse Units OtherProduction Status:
Existing-used
Use:
Dialogue
-
Paper title:Contextual Argument Component Classification for Class Discussions
-
Paper track:Short paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Luca Lugini | Discussion Tracker | /N |
Documentation:
http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.130.pdf
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Evaluating Unsupervised Representation Learning for Detecting Stances of Fake News
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Matthias Aßenmacher | FNC-1 | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
25,000 dialogues OtherProduction Status:
Existing-used
Use:
Dialogue
-
Paper title:A Taxonomy of Empathetic Response Intents in Human Social Conversations
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Anuradha Welivita | EmpatheticDialogues | /N |
Documentation:
There is a documentation in English available publicly.
Written
Language Modeling Tool,
Language Type:
Multilingual
Languages:
English Hebrew Italian Russian
Availability:
Freely Available
License:
Creative Commons
Size:
20 MByte Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:Priorless Recurrent Networks Learn Curiously
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jeff Mitchell | colorlessgreenRNNs | /N |
Documentation:
ReadMe in English
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
MIT
Size:
500 KByte Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:Automatic Charge Identification from Facts: A Few Sentence-Level Charge Annotations is All You Need
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shounak Paul | Charge Identification Dataset for Indian Law | /N |
Documentation:
English
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Romanian
Availability:
Freely Available
License:
Size:
None GByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Filtering Back-Translated Data in Unsupervised Neural Machine Translation
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jyotsana Khatri | WMT data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English Italian
Availability:
Freely Available
License:
CC BY-NC-SA 3.0
Size:
120000 tokens Production Status:
Newly created-finished
Use:
Parsing and Tagging
-
Paper title:Exploring the Language of Data
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gábor Bella | The Language of Data Annotated Corpus v1 | /N |
Documentation:
None
Written
Treebank,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
11.9 MByte Production Status:
Existing-used
Use:
Discourse
-
Paper title:Interactively-Propagative Attention Learning for Implicit Discourse Relation Recognition
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yu Hong | PDTB | /N |
Documentation:
There is an annotation guideline that is written in English. It may be publicly available.
Written
Corpus,
Language Type:
Multilingual
Languages:
English French Romanian
Availability:
Freely Available
License:
N/A
Size:
2,000,000 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Emergent Communication Pretraining for Few-Shot Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yaoyiran Li | Europarl | /N |
Documentation:
None




